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Abstract 

Recent work has suggested that in highly correlated systems, such as sandpiles, 
turbulent fluids, ignited trees in forest fires and magnetization in a ferromagnet close 
to a critical point, the probability distribution of a global quantity (i.e. total energy 
dissipation, magnetization and so forth) that has been normaUzed to the first two mo- 
ments follows a specific non Gaussian curve. This curve follows a form suggested by 
extremum statistics, which is specified by a single parameter a (a = 1 corresponds to 
the Fisher-Tippett Type I ("Gumbel") distribution.) 

Here, we present a framework for testing for extremal statistics in a global ob- 
servable. In any given system, we wish to obtain a in order to distinguish between 
the different Fisher Tippett asymptotes, and to compare with the above work. The 
normalizations of the extremal curves are obtained as a function of a. We find that for 
realistic ranges of data, the various extremal distributions when normalized to the first 
two moments are difficult to distinguish. In addition, the convergence to the limiting 
extremal distributions for finite datasets is both slow and varies with the asymptote. 
However, when the third moment is expressed as a function of a this is found to be a 
more sensitive method. 
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1 Introduction 



The study of systems exhibiting non Gaussian statistics is of considerable current interest (see 
e.g. Somette (2000) and references therein). These statistics are often observed to arise in finite 
size many body systems exhibiting correlation over a broad range of scales; leading to emergent 
phenomenology such as self similarity and in some cases fractional dimension (Bohr et al., 1998). 
The apparent ubiquitous nature of this behavior has led to interest in self organized criticaUty 
(Bak, 1997; Jensen, 1998) as a paradigm; other highly correlated systems include those exhibiting 
fully developed turbulence. In solar terrestrial physics in particular, problems of interest include 
MHD turbulence in the solar wind and in the earth's magnetotail. Irregular or bursty transport 
and energy release in the latter has recently led to complex system approaches such as SOC (see 
the review by Chapman and Watkins, Space. Sci. Rev., 2001). These complex systems are often 
characterized by a lack of scale, and in particular, by the exponents of the power law probabiUty 
distributions (PDF) of patches of activity in the system. Examples of these patches of activity are 
energy dissipated by avalanches in sandpiles, vortices in turbulent fluids, ignited trees in forest 
fires and magnetization in a ferromagnet close to the critical point. In the earth's magnetotail, 
patches of activity in the aurora as seen by POLAR UVI have been used as a proxy for the energy 
released in bursty magnetotail transport in order to infer its scaling properties (Lui et al., 2000; 
Uritsky et al., 2001). The challenge is to distinguish the system from an uncorrelated Gaussian 
process, by demonstrating self similarity; and to determine the power law exponents. To do this 
directly is nontrivial, requiring measurements of the individual patches or activity events over 
many decades. Here we consider what may be a more readily accessible measure, the statistics of 
a global average quantity such as the total energy dissipation, magnetization and so forth. 

An important hypothesis that is the subject of this paper is that the data arise from an extremum 
process; i.e. that some unknown selection process operates such that the observed global quantity 
is dominated by the largest events selected from ensembles of individual 'patches' of activity. This 
is a real possibiUty for two reasons. First, measurements of physical systems, and in particular, 
observations of natural systems, inevitably incorporate instrumental thresholds and this may affect 
the statistics of a global quantity comprising activity summed over patches. Second, there has 
recently been considerable interest in a series of intriguing results from turbulence experiments 
(Labbe et al., 1996; Pinton et al., 1999; Bramwell et al., 1998), and numerical models exhibiting 
correlations (Bramwell et al. (2000), but see also Aji and Goldenfeld (2001); Zheng and Trimper 
(2001); Bramwell et al. (2001)). These studies reveal statistics of a global quantity (i.e. E) that 
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follows curves that are of the form of one of the limiting extremal distributions: (Gumbel, 1958; 
Fisher and Tippett , 1928) 

P{E) = K{ey-^')'' 

y = b{E-s) (1) 

where K, b and s are obtained by normalizing to the first two moments (Mq = 1, Mi = 0, 
M2 = 1), and the single parameter a appears to be close to the value tt/2. 

For an infinitely large ensemble, there are two limiting distributions that we consider here. The 
Fisher-Tippett type 1 (or 'Gumbel') extremal distribution is of the form (1) but with a = I and 
arises from selecting the largest events from ensembles with distributions that fall off exponentially 
or faster. Since we wish to construct a framework that could encompass all highly correlated 
systems we also treat the case where the distribution of 'patches' is power law. An example is 
the Potts model (Cardy, 1996) for magnetization where connected bonds form clusters, the size of 
which is power law distributed at the critical point. In this case the relevant extremal distribution 
is Fisher-Tippett type n (or 'Frechet'). 

Here we provide a framework for comparing data with Fisher Tippett type I and n extremal 
curves. This essentially requires obtaining the normalizations of these curves in terms of the 
moments of the data and ultimately as functions of the single parameter a. 

We find that the curves of form (1) which are obtained by normalizing to the first two moments 
are difficult to distinguish if a is in the range [1, 2] or from Frechet curves given a realistic range 
of data. Furthermore we demonstrate that slow convergence with respect to the size of the dataset, 
to the limiting a = 1 extremal distribution has the consequence that, for a large but finite ensem- 
ble, the extremal distribution of an uncorrelated Gaussian process is indistinguishable from the 
a = 7r/2 curve. To overcome these limitations we suggest two much more sensitive methods for 
determining whether or not the curve is of the form (1), and, if so, the corresponding value of a. 
These methods are based on the third moment, and the peak of the distribution, both of which we 
obtain here as a function of a. 

2 Extremum statistics: general results. 

To facilitate the work here we first develop some results from extremum statistics (for further 
background reading see Sornette (2000); Gumbel (1958); Bouchaud and Potters (2000)). If the 
maximum Q* drawn from an ensemble of M patches of activity Q with distribution N{Q) is 
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Q* = max{Qi, ..Qm}, then the probability distribution (PDF) for Q* is given by 

P^iQ*) = MN{Q*){1 - iV>(Q*))^-i (2) 

where M is the number of patches in the ensemble and 

N>{Q*) = / N{Q)dQ (3) 
Jq* 

We now obtain for large M, Q. For general PDF N{Q) we can write (for appropriate choice 
of the function g{Q*)): 

(1 - A^>)^ = e-^^(Q*) (4) 
and for small iV> {Q*) we have 

g{Q*) = - ln(l - iV>(Q*)) - iV> + ^ (5) 
We now consider a characteristic value of Q*, namely Q*, such that by definition 
Mg{Q*) = q (6) 
so that 

q = Mg{Q*)f^MNy{Q*)+M ' + ■ ■ ■ (7) 

We now expand g{Q*) about Q* to obtain 

gm = gm+g'm^Q* + ^^(aq*)^ + . . . (8) 

and from (5) we have 

g'{Q*) = -N{Q*) - iV(Q*)7V> + . . . (9) 
g"iQ*) = -N'iQ*) - N'iQ*)N> + N^{Q*) + ■■■ (10) 

where g',g" denote differentiation with respect to Q*, AQ* = Q* — Q*, and we have used 
= dN^/dQ* = —N. Inverting expansion (7) gives 



MN>{Q*)=q 



M(l-e-M^ (11) 



We obtain from (5) and its derivatives with respect to Q*: 



+ 0(^)' (12) 



which to relevant order is consistent with (6), and 



M 



(13) 



For q finite as M ^ oo this gives g'{Q*) = -N{Q*) and MN>{Q*) = q. 

We can now consider the extremal statistics of specific PDF N{Q), and importantly show that 
PmiQ*) can be written in the universal form (1). 

2. 1 Gaussian and Exponential N{Q) 

If N{Q) falls off sufficiently fast in Q, i.e. is Gaussian or exponential it is sufficient to consider 
lowest order only in (5) giving g{Q*) ^ (Gumbel, 1958; Bouchaud and Mezard, 1997) and 
q = MN^{Q*). Expanding (3) in Q* near Q* gives to this order: 



MN^iQ*) = M 
MN{Q* 



N{Q)dQ-MN{Q*)AQ 
AQ* + --- ^qe 



(14) 



Expanding N{Q) about Q* yields 



N{Q*) = N{Q* 



N{Q*) 



N{Q*)e^(Q^^ 



(15) 



As to this order (1 - N>)^-^ ^ (,-mn> ^^^^ jj^^g ^^^^ ^2) 



Pm{Q*) = MN{Q*){l-N>{Q*)) 



*\\M-l 



MN{Q*)e-^^> 



(16) 



with 



a = — - 



and 



N'{Q*)N>{Q* 



(17) 



,=jMim]^imAQ* (18) 

Since throughout we are considering Q* large (M oo,q finite) we have the effective value of 
a as that given by (17) in the Umit Q* oo. For N{Q) exponential the above gives a = 1. In the 
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particular case of the exponential all the summations which in the above we have truncated can be 
resummed exactly and give a = 1, recovering the result of Bouchaud and Mezard (1997). 

For N{Q) Gaussian we cannot obtain a exactly in this way but as we shall see it is instructive 
to make an estimate. Given N{Q) = Nq exp(— AQ^) and expanding equations (14), (15) and (16) 
to next order we obtain 



P — P pR{u) 



(19) 



where we have used u = —2XQ*AQ* and u = u + ln(g). To lowest order in AQ*/Q* (i.e. 
Q* oo) we have a universal PDF with a = 1, but to next order, that is, neglecting only the term 
in in (19) we have a universal distribution of form (1,16) with 



2.2 Power law N(Q) 



(20) 



The PDF of patches N{Q) may however be a power law and in this case it will fall off sufficiently 
slowly with Q that we need to go to next order as in (7). If we consider a normalizable source PDF 



then for large Q (Q >> 1) we have N(Q) ~ Nq/Q'^^ and then using (3) and (7) 
Q*N{Q*) = {2k - l)iV>(Q*) = {2k - 1)|:(1 - ^) 



(21) 



(22) 



which with the above general expressions for g{Q*) and its derivatives substituted into (8) gives 
an expression for g{Q*) 



g{Q*) = -1- 



AQ* 



AQ* 



I -{2k- 1)^^ + k{2k - 1)( 

Q* Q 



(23) 



We also require an expression for N{Q*), again expanding about Q* and obtaining the derivatives 
of N{Q*) from those of g{Q*) and via (11) gives 



N{Q*) = N{Q 
which can be rearranged as 

N{Q*) = N{Q*)e 



l-2k^ + k{2k + l){^f 



(24) 



(25) 



6 



After some algebra (23) can be rearranged to give 

Mg{Q*) = qey ^ ' (26) 

These two expressions combine to finally give 

PmiQ) ^ PmiQ*) ie^'-'^r (27) 
with 

n = -ln(o) -ln(g) - (2fe- 1)^(1- ^) (28) 
^ ' ^ ^ Q* ^ 2Q* 

and 

2k 

(29) 



2A;- 1 

To lowest order, neglecting the {AQ*/Q*)'^ term (28) reduces to (18). 

Hence a power law PDF has maximal statistics (Q) which, when evaluated to next order, 
can be written in the form of a universal curve (i.e. of form (1,16)) with a correction that is non 
negligible at the asymptotes. This can be seen (Jenkinson, 1955; Bouchaud and Potters, 2000) to 
be consistent with the well known result due to Frechet where (following the notation of Bouchaud 
and Potters (2000)) if we have PDF 

^(^) - (30) 
then 

iV> ~ ^ (31) 
which we can write in the form 

Pm{x*) = iie'ir'''^'ir)^e''-ey (32) 
u = -ii \n{x*) - In (33) 

which is of universal form (1,16) in u. Noting that here jj, = 2k — I and a = (/x + l)//x and that 
to second order 

^(l-^)=lnfl + ^^ (34) 
Q* ^ 2Q*' \ Q* ) 

we simply identify 1 + A(5*/Q* with x* to obtain (28). To next order in AQ*/Q* the analogue 
of (28) still yields the right hand side of (34). 
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2.3 Convergence to the limiting distributions 



The above results should be contrasted with the derivation of Fisher and Tippett (Fisher and Tippett 
, 1928). Central to (Fisher and Tippett , 1928) and later derivations is that a single ensemble of 
NM patches has the same statistics as the N ensembles (of M patches), of which it is comprised. 
The fixed point of the resulting functional equation (Bhavsar and Barrow, 1985) for arbitrarily 
large N and M is a = 1 for the exponential and Gaussian PDF, and the Frechet result for power law 
PDF. Here, we consider a finite sized system so that although the number of reaUzable ensembles 
of the system can be taken arbitrarily large, the number of patches M per ensemble is always 
large but finite. Importantly, the rate of convergence with M depends on the PDF N{Q). For 
an exponential or power law PDF we are able to resum the above expansion exactly to obtain 
a; and convergence will then just depend on terms 0(1/M) and above. This procedure is not 
possible for N{Q) Gaussian, instead we consider the characteristic Q*, that is Q* which for M 
arbitrarily large should be large also. Rearranging (7) to lowest order for N{Q) = Nq exp(— AQ^) 
yields VXQ* ~ y^\n{M) implying significantly slower convergence. This is further discussed in 
Sornette (2000) (pp. 19-21). 

The extremal distributions are thus essentially a family of curves that are approximately of 
universal form (1,16) and are asymmetric with a handedness that just depends on the sign of Q; we 
have assumed Q positive whereas one could choose Q negative in which case N{Q) N{\ Q |). 
This would correspond to, say, power absorbed, rather than emitted, from a system. The single 
parameter a that distinguishes the extremal PDF then just depends on the PDF of the individual 
events. For N{Q) exponential we then recover exactly the well known result (Gumbel, 1958; 
Bouchaud and Mezard, 1997) a = 1. For a power law PDF a is determined by k via (29). We 
have also demonstrated that for a Gaussian PDF with finite but large M and N that 1 and will 
explore the significance of this in Section 3.1. 

3 Normalization to the first two moments 

To compare these curves with data we need P{Q) = Pm{Q*) in normaUzed form. This has 
moments 



which we will obtain as a function of a and then insist that Mq = 1, Mi = and M2 = 1. 




(35) 
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Setting Ml = (and Mq = 1, M2 = 1) in our analysis of exti-emal distributions does not 
require any assumptions about the form of the PDF except that the moments exist. It will allow us 
to write the analytically obtained extremal distributions as functions of single parameter a. 

3.1 Extremal distributions arising from Gaussian and exponential N{Q) 

For Gaussian and exponential PDF we have 

P{y)=K{e--^''r (36) 
u = h{y- s) (37) 

This has moments which converge for all n. From Appendix A we have that the v}^ moment: 

M„ = \ II FW, = if«-"'"<"'^r(a) (38, 

where rj = ln(a) — u. 

To normalize we insist that Mq = 1, Mi = and M2 = 1. The necessary integrals can be 
expressed in terms of derivatives of the Gamma function T{a) (Gradshteyn and Ryzhik (1980)) 
and we obtain in Appendix A: 



s = — 



aln(a) 

r(a) 

(*(a)-In(a)) 



K = ^ e°'"^°^ (39) 
r(a) ^ ^ 



b 

where 

^, , 1 dr{a) 



r(a) da 

* (a) = -r 

da 

The ambiguity in the sign of b (and hence s) corresponds to the two solutions for P{Q) for 
positive and negative Q. 

We can now plot the curves, that is, normaUzed to the first two moments and these are shown in 
Figure 1. Experimental measurements of a global PDF P{E) normalized to Mq would be plotted 
M2P versus {E — Mi)/M2. In the main plot we show normalized distributions of the form (1,16) 
for a = 1, 7r/2 and 2. It is immediately apparent that the curves are difficult to distinguish over 
several decades in P{y) and thus to obtain a good estimate for a, the numerical or real experiments 
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would require good statistics over a dynamic range of about 4 decades, something which is not 
readily achievable. 

On Figure 1 we have also over plotted (*) the extremal PDF of ensembles of uncorrelated num- 
bers that are Gaussian distributed, calculated numerically. We randomly select M uncorrelated 
variables Qj,j = 1, M and to specify the handedness of the extremum distribution, the Qj are 
defined negative and N{\ Q |) is normally distributed. This would physically correspond to a 
system where the global quantity Q is negative, i.e. power consumption in a turbulent fluid, as 
opposed to power generation. To construct the global PDF we generate T ensembles, that is select 
T samples of the largest negative number Q| = min{Qi..QM},i = 1, T. For the data shown in 
the figure M = 10^ and T = 10^; this gives ^/XQ* ~ ^ln(M) ~ 3 so that for the Gaussian we 
are far from the a = 1 limit (Fisher and Tippett , 1928). The numerically calculated PDF lies close 
to a = 7r/2. Such a value of a on these curves thus does not give direct evidence of a correlated 
process; in addition it is necessary to estabhsh that the data considered do not arise as the result of 
an extremal process. 

Generally, plotting data in this way is an insensitive method for determining a and thus dis- 
tinguishing the statistics of the underlying physical process. The question of interest is whether 
we can determine the form of the curve, and the value of a from data with a reasonable dynamic 
range; we address this question in section IV. 

3.2 Frechet distributions arising from power law N{Q) 

For power law PDF (21) we use the Frechet distribution which we first write as: 



which reduces to the form of (37) for AQ*/Q* < 1. From (28), (21) and (33) we identify 



The procedure of normalizing to the moments is only valid provided that they exist. For the power 
law PDF (21) we have (see also Bury (1999)): 




(40) 



(41) 



|3 = -^i = -{2k-l) 



(42) 




Which converges for Q ^ and for Q 



oo 
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which if H{Q) Hq as Q ^ oo 

ivin J Q2k-n — Q2k-n-l IQ^oo 

which converges if 2k > n + 1. 

We now evaluate the moments. Again we insist that Mq = 1, Mi = and M2 = 1 and in 
Appendix B obtain: 



a = —P In 

K = ±/?a" 



r(i + 1//?) 



r(i + |)-r2(i + i) - (43) 



r(i + i) 

Q* = - ^ 7T 



2 



r(i + §)-r2(i+i) 

where /? = —{2k — 1). The normalization constants are thus also expressible as functions of 
a = 2k/{2k - 1). 

For convergence, these curves exist for power law of index 00 > 2k > Z i.e. 1 < a < 3/2. This 
is significant since processes exhibiting intermittency as a consequence of long range correlations 
typically have k lower than this (Jensen, 1998), and we will consider alternative methods in section 
5. 

In Figure 2 we plot the normalized Fisher Tippett type 11 or Frechet PDF for /c = 2, 5, 100 
and for comparison the Fisher Tippett type 1 ('Gumbel') PDF with a = 1. From (29) o = 1 
corresponds to /c ^ 00 and it is straightforward to demonstrate from the algebra that in this limit, 
the normahzed Frechet PDF tends to Gumbel's asymptote a = 1. Hence on this plot we see that 
for A; = 100 these are indistinguishable and differences between the Frechet and Gumbel PDF 
only appear on such a plot around the mean for < 3 approximately. This demonstrates that these 
extremal curves arising from an uncorrelated Gaussian, exponential or power law N{Q) will all be 
difficult to distinguish from the curve (1,16) with a 7^ 1. We now consider more sensitive methods 
to determine a. 



4 Sensitive indicators of a; the mean and the third moment 

The question of interest is whether we can determine a with sufficient accuracy from data with a 
reasonable dynamic range. We consider two possibiUties here. 

First, a uniformly sampled process will have the most statistically significant values on the ex- 
tremal curve near the peak, and in particular, from the figures we see that the Frechet distributions 
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for small k will be most easily distinguished in this way. For the Frechet PDF the peak is at n = 0, 
that is, it has coordinates 



™ — n 



y = Q 



1 



(44) 



on the normahzed curve with K,Q,a,P known as functions of a from Appendix B. The coordi- 
nates of the peak of the PDF from the data plotted with Mq = 1, Mi = and M2 = 1 can thus be 
graphically inverted to give an estimate of a. 

For PDF that are power law with large k, exponential or Gaussian, we consider the normalized 
extremal PDF; then the coordinates of the maximum of P{y) is at u = 0, y = s, that is: 



(45) 



e" r(a) 

with K, s from (A14). These can again be graphically inverted to obtain a; Figure 3 shows P and 
y versus k for the Frechet PDF. 

A more sensitive indicator may be the third moment of P of the curve (1,16) which after some 
algebra (Appendix A) can be written as 



M3 = 



(^"(a))2 

for a Gaussian or exponential PDF i.e. with (37) and 

r(i + 3)_3r(i + 2)r(i+i) + 2r3(i+iy 



(46) 



M: 



(47) 



r(i + |)-r2(i+ 1) 

for a power law PDF (Appendix B) i.e. with (41); the latter then converging for k > 2. Again these 
refer to one of the two possible solutions for P{Q)\ the other solution corresponding Xoy ^ —y 
(Q* -Q*) in equations (37,41) which in turn gives M3 -M3. 

The third moment is plotted versus a and k respectively in Figure 4 for the Gumbel and Frechet 
curves. Inspection of Figure 4 shows that over most of the range, M3 is more sensitive than P. 
For Frechet curves, M3 only has convergence for relatively large A; (A; > 2, a < 4/3); for smaller 
k, P can distinguish the Frechet distributions (k > 3/2, a < 3/2 for convergence). 



5 A method for small k 

For N{Q) power law, we can only use the properties of the normalized Frechet PDF above for 
k > 3/2. If A; is smaller than this the second moment will not exist. We can however obtain a 
useful result for A; > 1 by using the first moment only, i.e. by insisting Mq = 1, Mi = 0. We 
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need another condition and can arbitrarily insist P{u = 0) = 1 (insisting that all the maxima of 
the Frechet PDF have the same height) which gives the condition 

Ke-'' = 1 (48) 

From B6 and B5 

KQ* 



which, with g^/^ = T{1 + 1/13) from Appendix B gives Q* in terms of a and /3 (or k). Similarly 
we use (B5); g = ae°' to obtain a in terms of a and /3. 
This then gives 



P^iQ*) = K (e^-^y 
AO* 

u = a + l3ln{l + -^) 

a = /31n(^r(l + -^)^ -ln(a) 
Q* = pe^^H^^-^) 
K = e" 



6 Conclusions 

Recent work has suggested that the probability distribution of some global quantity, such as total 
power needed to drive rotors at constant velocity in a turbulent fluid, or total magnetization in a 
ferromagnet slightly off the critical point, when normalized to the first two moments, follows a 
non-Gaussian, universal curve. This curve is of the same form as that found from the extremal 
statistics of a process that falls off exponentially or faster at large values (i.e. Fisher-Tippett type I 
or 'Gumbel'); but whereas for an extremal process the parameter specifying the curve a = 1, for 
the correlated processes a > 1. 

In this paper a framework has been developed to compare data with Fisher-Tippett type 1 ('Gum- 
bel') and type 11 ('Frechet') asymptotes by obtaining the curves, and their normalizations, as a 
function of a single parameter a. We find: 

1. The Fisher Tippett type I and type n curves and their corresponding values of a are most 
easily distinguished by considering either the third moment, or the position of the peak, as 
functions of a, the functional forms for which are given here. 
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For realistic ranges of data, simply comparing curved normalized to the first two moments as 
for example in (Bramwell et al., 1998, 2000) is insufficient to adequately distinguish either 
curves of the form of type I ('Gumbel') but with a values in the range [1,2], or most type n 
('Frechet') curves. 

2. Convergence to the limiting form of the extremal curve a = 1 (Gumbel's asymptote Fisher 
and Tippett (1928)) is sufficiently slow for an uncorrelated Gaussian that for a large but 
reahstic size of dataset one obtains a it/ 2. Data which falls on this curve is thus not 
sufficient to unambiguously distinguish a global observable of a system that has correlations 
(Bramwell et al., 1998, 2000), from that of an uncorrelated, extremal process. 

Comparison with data is then facilitated in the following way. First, the data distribution is 
normalized to Mq (to obtain the PDF N{Q) say). Second, the data is plotted on semilog axes under 
the following normalization: N{Q) XM2 versus {Q~Mi)/M2. Any Gaussian PDF on such a plot 
will fall on a single inverted parabola; similarly any Gumbel (Fisher Tipett 1) process will fall on a 
single curve. Finally, M3 is calculated for the data; we then can compare the data with an extremal 
process by inverting M3(a) obtained here for a Fisher Tipett type 1 or 11 distribution. Overlaying 
these curves (augmented by other quantitative comparisons) then essentially constitutes a fitting 
procediure; but importantly, in addition the value of a is related to the underlying distribution as 
we have discussed. 

This and related techniques will have relevance in particular for regions where transport is 
dominated by turbulence, in the solar wind and magnetosphere in circumstances where multipoint 
and long time interval in situ measurements are difficult to obtain. 

Acknowledgements. The authors would like to thank G. King, M. P. Freeman, D. Somette and J. D. Barrow 
for iUuminating discussions. SCC was supported by PPARC. 



Appendix A Moments of the Gumbel distribution and the normalization b, K and 5 as a 
function of a. 

We consider a family of curves of the form 

P{y) = Ke-"-"~" (Al) 

with u = b{y — s) where K, b, s are constants to be derived as functions of a. We write 

77 = In a — b{y — s) = In a — u (A2) 
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then ae " = and drj = —bdy, and the n*'* moment is given by 

/■~ no. ^J 1 /""^ [ln(a) + 6s - r/]" 
" = J_^y Piy)dy = ^ j_^P{y)dr) ^ ^ 

Then, using A2, we write P{y) (Al) as 
P{y) = K e-«(i^(«)-^)-e'' = i^e"''-^" 

where K = ife-°''^(«). 
Now to within a constant we can write M„ as: 

rrP{y)dr] = K / rf e'''^-^^ dr] 

-OO J —OO 

so that Mo = Mo/6. Using the substitution r = e'' A5 becomes 

/•OO _ 

Mn = K (In r)"T''-^e-^dT = iC— r(a) 
io "a" 

where r(a) is the Gamma function. Thus 

Mo = KT{a) 

Ml = Kr{a)-^{a) = Mo*(a) 
M2 = ^r(a)[*2(a) + ^''(a)] 

= Mo(^'^(a) + *'(a)) 

where 

dr{a) 1 



*(a) = 



da r(a) ' 



We now insist that Mo = 1, Mi = and M2 = 1. 
Thus 

Mo _ ma) 



° 6 6 
and 



1 /■°° 

Ml = = -2 / P{y)d'n[ln{a) + bs - rj] 
J— 00 

1 



^ [(ln(a) + 6s)Mo - Ml 



so 

^ = ln(a) + 6s = *(a) 
Mo 
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from A7. Thus 

bs = *(a) - ln(a) 
Also 

M2= 1 = p y_ P{y)dv[Ha) + bs- T]]' 



1 



(ln(a) + bsfMo - 2(lna + bs)Mi + M2 



which, using A7 and AlO rearranges to give 

Mo. 



Mo = 1 



63 



^' (a). 



This finally gives the normalisation of the universal curve 



K 



thatis K 



s = 



b 

"W) 

_^ aln(a) 

(^'(a)-ln(a)) 



(All) 



(A12) 



(A13) 



(A14) 



The above results will also yield an expression for the third moment in terms of a. Following A3 
and A5 we have 



1 

M3 = ^ y_ P{y)d7][ln(a) + bs- tjY 



^4 ^(ln(a) + bsfMo - 3(lna + bsfUi + 3(ln(a) + bs)M2 - M3 
Then A6 gives 

M3 = Mo k(a)(^'2(a) + *'(a)) + 2^'(a)^'(a) + ^"(a) 

which with A7 and AlO rearranges to give 
*"(«) 



M 



(^''(a))3/2 



(A15) 



(A16) 



(A 17) 
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Appendix B Moments of the Frechet distribution and normalization as a function of a. 



The moments of a Frechet distribution are obtained in Bury (1999). Here we wish to consider PDF 
of the form (19) which has extremum statistics 

PmiQ) = K{e---''r (Bl) 
where, following (25-32) we write: 

n = a + /31n(l + 2) (B2) 

Q 

where here we use the notations Q = AQ*, Q = Q*, that is, Q refers to extremal values. From 
(26), a and /? = {2k — 1) are constants. We can then define the moments of PmiQ)'- 

/oo 
. Q^dQPmiQ) (B3) 
-Q 

since from B2u ^ ooasQ ^ oo and u — oo as Q ^ —Q. Using the substitution ae" = C we 
obtain after some algebra 

Mn = KQ"" / - 1)" C»-i+V/3 e-C d( (B4) 

Jo 9 

where the constants 

KO 

g = ae" and K = — (B5) 

By taking the expansion u = a + (5Q/Q it is straightforward to verify that B4 yields the results 
from Appendix A. We now insist that Mq = 1, Mi = and M2 = 1. 
B4 then gives 

Mo = 1 = KT{a) where a = a + 1//? (B6) 
and 

M, = o = ifQ|n2_LiM_r(s)l 

that is 

V{a+^)=g^/^T{a) (B7) 
and using B7 we have from B4: 

17 



and since 



that is 

-2rr(a)r(a + 2/^_ 

using B6. 

Now from the main text (27) a 

(3 = -{2k -1) 
a = a + 1/13 = I 



and r(a) = r(l) = 1. 
B7 then gives 5^/^ = r(l + 1//3). B8 then gives Q: 



g = ± — 

r(i + |)-r2(i+ 1; 

then B7 gives K as 

/?a«r(l + 1/(5) 



K = ±- 



Q 



and since g = ae", B6 gives an expression for a: 
KQ 



that is: 



a 



-/31n 



r(l + 



which completes the normalization of B1,B2 as functions of k or a. 
Using B7 we have from B4 an expression for the third moment: 

T(a + |)r3(a) 3r(a + |)r2(a) 3r(a + i)r(a) 



M3 = KQ' 



+ 



V{a) 



r3(a + i) r2(a + i) ' r(a + i) 

Expansion in 1//3 readily shows that to lowest order result A17 is recovered. 

Then using B9, BlOandBll, B13 can be rearranged to give M^{(5), and hence M3 
of k or a: 

r(i + 1) - 3r(i + §)r(i + + 2t\i + 1)" 



r(i + i; 
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Figure Captions 



Fig. 1. Curves of the form (1) for a = 1, 7r/2, 2. Overlaid (*) are the numerically calculated extremal 
statistics of an uncorrelated Gaussian process (see text), and inset for comparison are Frechet curves plotted 
on the same scale (see Fig. 2). 



Fig. 2. Frechet PDF normaUzed to the first two moments for PDF N{Q) = + Q'^)'', k = 2,5, 100. 



Fig. 3. The peak (a) and its location (b) as a function of k for Frechet curves. 



Fig. 4. The third moment as a function of a for (a) curves of form (1) and (b) Frechet curves. 
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